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ABSTRACT 

Data from the AEGIS, COSMOS and ECDFS surveys are combined to infer the bias and 
dark matter halo mass of moderate luminosity [Lx (2 — 10 keV) = 42.9 erg s _1 ] X-ray AGN 
at z « 1 via their cross -correlation function with galaxies. In contrast to standard cross- 
correlation function estimators, we present a method that requires spectroscopy only for the 
AGN and uses photometric redshift probability distribution functions for galaxies to deter- 
mine the projected real-space AGN/galaxy cross-correlation function. The estimated dark 
matter halo mass of X-ray AGN in the combined AEGIS, COSMOS and ECDFS fields is 
« 13/z -1 M , in agreement with previous studies at similar redshift and luminosity ranges. 
Removing from the sample the 5 per cent of the AGN associated with X-ray selected groups 
results in a reduction by about 0.5 dex in the inferred AGN dark matter halo mass. The distri- 
bution of AGN in dark matter halo mass is therefore skewed and the bulk of the population 
lives in moderate mass haloes. This result favour cold gas accretion as the main channel of 
supermassive black hole growth for most X-ray AGN. 

Key words: galaxies: active, galaxies: haloes, galaxies: Seyfert, quasars: general, black hole 
physics 



1 INTRODUCTION 

The clustering properties of Active Galactic Nuclei (AGN) are a 
powerful diagnostic of the mechanism that dominates the fuelling 
of Super-Massive Black Holes (SMBHs) across cosmic time. Cold 
gas accretion scenarios for example, predict that AGN live in Dark 
Matter Haloes (DMHs) of up to few times 10 12 h -1 M , almost in- 
dependent of accretion luminosity and redshift (M arulli et al. 2008 ; 
|Hopkins et al.|2 008 ). In contrast, if the growth of SMBHs is domi- 
nated by accretion from a hot quasi-hydrostatic halo then one might 
expect more luminous AGN in more massive DMHs (e.g. Fanidakis 
|et al.|2012| >. 

Numerous observational programmes have been initiated in 
the last few years to test those predictions by measuring the 
DMH mass of AGN as function of redshift and accretion lu- 
minosity. Large spectroscopic campaigns of powerful UV-bright 
QSOs showed that this population lives in DMHs of a few times 
lO^h^M^ over a wi de range of re dshifts (e.g. |Croom et al.|2005 



da Angela et al.||2008| |Ross et al.||2009| ). This is consistent with 



the predictions of cold gas accretion models for the fuelling of the 



SMBHs. At the same time there has been progress in the study of 
the clustering of the less luminous X-ray selected AGN. These sys- 
tems provide a more representative and less biased census of the 
active SMBH population that dominates the accretion density of 
the Universe (e.g |Aird et al.|2010| . The DMH masses of this class 
of sources are estimated to be, on average, larger than th ose of UV- 
bright QSOs, \ogM DM H = 12.5 - 13.5 (h _1 M , Cappelluti 
|et al.|2012] >. This has been interpreted as evidence against cold-gas 
accretion via major-mergers in those systems (e.g. |Allevato et al.| 
2011 ; Mountrichas & Georgakakis 2012). However, there is scat- 
ter among the individual measurements of the DMH mass of X-ray 
AGN, which makes it hard to comment on the redshift and/or ac- 
cretion luminosity dependence of their clustering properties. This is 
because both random errors, related to e.g. small sample sizes, and 
systematic uncertainties, such as sample variance, affect current de- 
terminations of the typical DMH mass of X-ray AGN. Both these 
issues are related to the strong requirement of current clustering 
estimators for spectroscopy to get robust clustering measurements. 

The real-space cross-correlation function of AGN with galax- 
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ies for example, is arguably one of the most reliable methods for 
clustering stud ies (e.g. |Coil et al.| 2009 



gakakis 2012, Krumpe, Miya ji, & Coil 



Mountrichas & Geor- 



2010[ [Krumpe, Miyaji, 



Coil, & Aceves|2012| ). Random errors are significantly suppressed 



when counting AGN/galaxy pairs and the impact of sample vari- 
ance, which is shown to affect even relatively large-area X-ray 
surveys (e.g. |Gilli et aL|2 009) is minimised. However, these fea- 
tures do not come for free. The classic method for determining 
the AGN/galaxy cross-correlation function requires extensive spec- 
troscopy for both AGN and galaxies. This limits the number of X- 
ray fields that such an analysis can be applied, as spectroscopy for 
large galaxy samples is expensive in resources. One way to address 
this limitation is to relax the requirement for spectroscopy in clus- 
tering studies. 

An important development in the last few years in this direc- 
tion has been the rapid improvement in the quantity and quality of 
photometric redshift measurements, particularly for galaxies (e.g. 
Coup on et al.|20 09). Large scale multi- waveband photometric sur- 
veys, such as the SDSS (Sloan Digital Sky Survey, Abazajian et al.| 
2009) and CFHTLS ( |Coupon et al.|2009) , have managed to con- 
trol random and systematic photometric uncertainties, which in turn 
translates to large improvements in photometric redshift estimates. 
This development has motivated methods that use photometric red- 
shifts, in combination with spectroscopy, to determine the cluster- 
ing of extragalactic populations ( [Myers et~aT. 2009 ; Hicko x~t al.| 
|20TT] >. 

In this paper we present a method similar to that of Mye rs et aT] 
(2009 ) that uses photometric redshifts only for galaxies and spec- 
troscopy for X-ray AGN to estimate the AGN/galaxy real- space 
cross-correlation function and infer the bias and DMH mass of 
AGN. It is shown that by weighing each galaxy with its photo- 
metric redshift Probability Distribution Function yields clustering 
results similar to those obtained using spectroscopy for both AGN 
and galaxies. This methodology is then applied to all X-ray fields 
with public and good quality photometric redshift estimates and ex- 
tensive spectroscopy for X-ray sources. This translates to a signifi- 
cant increase, compared to previous studies, in the size of the X-ray 
AGN sample and better control on the impact of sample variance 
on the results. 

In Section 2 the AGN and galaxy samples are presented. The 
photometric redshift based methodology for the determination of 
the AGN/galaxy projected real-space cross-correlation function is 
described in Section 3. The results are presented in Section 4 and 
are discussed in Section 5. Throughout this paper we adopt Ho = 
100 kms" 1 Mpc" 1 , On = 0.3 and Ov = 0.7 and a 8 = 0.8. Rest 
frame quantities (e.g. luminosities, dark matter halo masses) are 
parametrised by h = Ho/100, unless otherwise stated. 



2 THE DATA 

Three extragalactic survey fields are used to determine the cluster- 
ing of X-ray AGN at z ~ 1. The Extended Chandra Deep Field 
South Survey (ECDFS), the All Wavelength Extended Groth strip 
International Survey (AEGIS, [Davis et al.|2007[) and th e Cosmolog- 
ical evolution Survey (COSMOS jScoville et al.]2007] ). The choice 
of fields is motivated by the availability of (i) X-ray data, (ii) ex- 
tensive follow-up spectroscopic programs targeting specifically X- 
ray sources and (ii) deep multiwavelength imaging (UV, optical, 
infrared) for the determination of photometric redshift Probability 
Distribution Functions (PDFs) for individual galaxies. 



2.1 Optically selected galaxy sample 

The COSMOS and AEGIS fields overlap with the D2 and D3 
regions respectively, of the deep synoptic Canada-France-Hawaii 
Telescope Legacy Survey (CFHTLS). The optical photometry 
{ugriz bands) of the T0004 data release is used, which includes 
photometric redshift estimates reliable to %ab < 24 mag and pho- 
tometric redshift PDFs ( Coupon et al. 2009). Regions of unreliable 
photometry (CFHTLS catalogue parameter FLAG_TERAPIX> 1) 
because e.g. of contamination by bright stars, are masked out. In the 
analysis we only use CFHTLS optical sources classified as galaxies 
(CFHTLS parameters OBJECT and FLAG_TERAPIX equal to zero), 
with reliable photometric redshift estimates (CFHTLS parameter 
ZP_RELIABLE/ -99) and 17.5 < %ab < 24 mag. Table[l]lists the 
total number of galaxies used in the D2 and D3 CFHTLS regions. 

In the ECDFS the photometric catalogue compiled by the 
MUSYC collaboration is used (Cardamone et al. 2010). They com- 
bined Subaru narrow-band imaging with existing UBVRIzJHK 
photometry and Spitzer IRAC images to create a uniform cata- 
logue of ~ 40,000 galaxies to Rab = 25.3. Reliable photo- 
metric redshifts, including PDFs, are also estimated to that mag- 
nitude limit (Carda mone et al.|2010| ). Our analysis uses MUSYC 
optical sources classified as galaxies (MUSYC catalogue parame- 
ter STAR_FLAG< 1; Cardamone et al. 2010) and magnitudes in the 
range 17.5 < %ab < 24 mag, as for CFHTLS galaxies. Table [I] 
lists the total number of galaxies used in the MUSYC field. 



2.2 AGN samples 

X-ray AGN are selected from the AEGIS-800ks survey (AEGIS - 
XD, Nandra et al. in prep), the Chandra COSMOS survey (C- 
COSMOS, |Elvis et al.|2009| > and the ECDFS ( |Lehmer et al.|2005] ). 
The AEGIS-XD covers a total of 0.3 deg 2 in the Extended Groth 
Strip to a total depth of 800 ks by combining Chandra ACIS-I ob- 
servations carried out in AO-3, AO-6 and AO-9. The AEGIS-XD 
and the ECDFS X-ray source catalogues are generated following 
the methodology of |Laird et al](|2009|>. Fo r C-COSMOS the source 
catalogue presented by Elvis et al. (2009) is used. In all three sur- 
veys we use all X-rays sources independent of the energy band in 
which they are detected. 

The optical identification of the X-ray sources is based on 
the Likelihood Ratio method ([Sutherland & Saunders 1992) as de- 
scribed in |Georgakakis et al.| ( [2009] ). The AEGIS-XD and COS- 
MOS source lists are matched to the T0004 release optical cata- 
logue of the CFHTLS ( |Coupon et aL][2009] >. The ECDFS X-ray 
sources are cross-matched with the MUSYC photometric catalogue 
( [Cardamone et al.|2010) . 

Optical spectroscopy of X-ray sources in the AEGIS field 
is primarily from the D EEP2 ([Newman et al.|[20T2] > and DEEP3 
galaxy redshift surveys ( [Cooper et al.|2011b[ 2012 ) as well as ob- 
servations carried out at the MMT using the Hectospec fibre spec- 
trograph ( |Coil et al.|2009|). Sp ectroscopic redshifts in the ECDFS 
have been co mpiled by [ Cardamone et al.| ( |2010|). We also include 
redshifts from [Silverman et al.| ( |2010| > and |Cooper et alT] ( |2011a| ). 
Redshifts in COSMOS are from the public releases of the VI- 
MOS/zCOSMOS bright project ( [Lilly et al.|2009|> and the Magel- 
lan/IMACS observation campaigns ( Trump et al.|20 09), as well as 
the compilation of redshifts for X-ray sources presented by Brusa 
|eTaT1 ( [2gT0l >. 

In the clustering analysis we use X-ray sources with Lx(2 — 
lOkeV) > 10 41 ergs _1 and redshifts 0.7 < z < 1.4. The rest- 
frame 2-10keV X-ray luminosity is estimated assuming a power- 
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Table 1. Galaxy samples. "Full sample" refers to galaxies in the optical 
catalogues of the MUSYC, CFHTLS-D2 and D3 after filtering out masked 
regions and applying the magnitude limits discussed in the text. The "re- 
sampled" galaxy sample is selected to have redshift distribution similar to 
that of X-ray AGN. The "DEEP2-like" sample applies the photometric cri- 
teria adopted by the DEEP2 redshift survey team to select galaxies in the 
redshift range 0.7-1.4. For details see SectionHOland Appendix A. 



field 


full sample 


"resampled" 


"DEEP2-like" 






galaxies 


galaxies 


(1) 


(2) 


(3) 


(4) 


CFHTLS (D2) 


55,367 


28,150 


21,731 


CFHTLS (D3) 


58,665 


33,389 


23,060 


MUSYC 


22,730 


15,169 


9,577 



Table 2. X-ray AGN samples in the COSMOS, AEGIS-XD and ECDFS 
fields with L x (2- 10 keV) > 10 41 ergs -1 and redshifts 0.7 < z < 1.4. 



field 


No. of sources 


< z > 


< log L x > 








(ergs -1 ) 


COSMOS 


182 


0.97 


43.3 


AEGIS-XD 


158 


0.97 


42.5 


ECDF-S 


101 


1.02 


42.8 



law spectral energy distribution with index T = 1.9. The vast ma- 
jority of sources above this luminosity cut are AGN (Georgakakis 
|et al.||2011| ). X-ray sources in regions that have been masked out 
because of poor optical photometry (e.g. bright stars) are excluded 
from the analysis. Table [2] shows for each field the number of X- 
ray AGN used for clustering measurements, their mean redshift and 
average X-ray luminosity. 



3 METHODOLOGY 

3.1 AGN/galaxy Cross- Correlation Function 

In real-space, the AGN/galaxy cross-correlation function, £(r), is 
given by 



DD(r) 
DR(r) 



1, 



(1) 



where DD(r) and DR(r) are the AGN/galaxy and AGN/random 
pairs at separation r. The random point catalogue should follow the 
galaxy sample selection function, i.e. magnitude limit, field bound- 
aries, masked regions. 

The distance r can be decomposed into separations along the 
line of sight, 7r, and across the line of sight, a. If s± and S2 are the 
distances of two objects 1, 2, measured in redshift- space, and the 
angular separation between them, then a and ty are defined as 



ty = (s2 — si), along the line-of- sight, 
a = ^ 2 — 0, across the line-of- sight. 

The correlation function in redshift- space is then estimated as 

DD(a, tt) 



DR(a,n) 



1. 



(2) 
(3) 

(4) 



This simple estimator has the advantage that the random point 
source catalogue needs to account only for the selection function 
of galaxies, which is typically a spatial filter. In contrast more 
advanced clustering estimators, such as that of Landy & Szalay 
( 1993 ), require the construction of random catalogues for the X-ray 



source population as well. This might introduce systematic biases 
into the calculations because X-ray observations have variable sen- 
sitivity accross the field of view, which is challenging to quantify 
accurately. 

In the classic approach of estimating the real-space cross- 
correlation function, spectroscopic redshifts are available for the 
AGN and galaxy samples. For each AGN/galaxy pair with a, ty 
separations determined from their sky positions and redshifts, the 
DD(a, ty) is incremented by one, i.e. DD(a, ty) = DD(a, ty) + 1. 
This relation can be modified to account for uncertainties in the 
determination of the redshifts of the galaxy population. In this 
case it is assumed that the redshifts of galaxies follow continuous 
probability distribution functions (i.e. photometric redshifts), while 
those of AGN are known at a high level of accuracy (i.e. spec- 
troscopic redshifts). For a given galaxy the probability f ga i that it 
lies at separations a, ty from an AGN can be estimated from its 
PDF. DD(a, ty) is then incremented by f ga i, instead of unity, i.e. 
DD(a, ty) — DD(a, TY) + f ga i. Therefore, a galaxy may contribute 
with different probabilities, f ga i, to more than one (a, ty) bins of an 
AGN/galaxy pair. DR(a, ty) pairs are estimated following the same 
procedure. Random catalogues are generated by randomising the 
positions of galaxies taking into account the geometry and masked 
regions of each optical survey. In this approach every random point 
has attached to it the photometric redshift PDF of the galaxy from 
which it was generated. A total of three random catalogues are pro- 
duced, one per survey field. Each of them has the same size as the 
real galaxy catalogue from which it was generated. 

When the correlation function is measured in redshift- space, 
the clustering is affected at small scales by the rms velocity dis- 
persion of AGN along the line of sight and by dynamical infall of 
matter into higher density regions. To first order, only the radial 
component of £(cr, ty) is affected by redshift- space distortions. We 
can therefore remove this bias by integrating along the line of sight, 
tt, to calculate the projected cross-correlation function 



w p (a) 



Jo 



£(cr, ty)cIty. 



(5) 



The maximum scale of the integration is a trade-off between un- 
derestimating the clustering amplitude, if TY m ax is too small, and 
low signal-to-noise ratio, if TYmax is too large. The optimum TYmax 
value is determined by measuring the projected AGN/galaxy cross- 
correlation function in the combined AEGIS, ECDFS and COS- 
MOS fields (see next section for details) for different TYmax val- 
ues in the range 10 - 100/i _1 Mpc. Figure [l] shows that the am- 
plitude of the AGN/galaxy cross-correlation function saturates for 
TYmax = 40 Mpc. This is the value we adopt in the analysis. 

Assuming that the real-space AGN-galaxy cross-correlation 
function follows a power-law of the form £(r) = (r/r ) -7 , we es- 
timate the real- space cross-correlation amplitude, ro, and the slope, 
7 from the relation 



Wp(a) 



r(|)r(- 



r(i) 



(6) 



where T(x) is the Gamma function. The AGN-galaxy cross- 
clustering strength can then be expressed in terms of the rms fluc- 
tuation of the density dis tribution over a sp here with a comoving 
radius of 8 /i _1 Mpc (e.g. |Miyaji et al.|2007| > 



2 

a 8,AG 



where 



A (7) 



r 



8k- 1 Mpc 



(7) 
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Figure 1. The AGN/galaxy cross-correlation length, ro, as a function of 
TTmax, the maximum scale of the integration in equation [5] The cross- 
correlation function is measured by combining the AEGIS, ECDFS and 
COMSOS fields. The errors are jackknife. 



Mi) = 



72 



(3- 7 )(4-7)(6-7)2^' 
the AGN/galaxy bias, 6ag, can then be calculated via 

&8,AG 



bAG - 



cr 8 ( z ) ' 



(8) 



(9) 



3.2 Galaxy auto-correlation function and AGN bias 

An estimate of the galaxy bias is required to infer the AGN bias 
from the AGN/galaxy cross-correlation function. If spectroscopy is 
available for galaxies their bias can be determined from the pro- 
jected real-space galaxy auto-correlation function (e.g. equations 

HUB- 

In the absence of spectroscopy, the galaxy bias is estimated by 
deprojecting to 3-D (Limber 1953 ) their angular auto-correlation 
function, w(0). This calculation assumes a power-law form for the 
correlation function. It also requires knowledge of the redshift dis- 
tribution, N(z), of the galaxy population. The latter is determined 
by summing the photometric redshift PDFs of galaxies. In section 
4.2 we confirm that the galaxy bias inferred from w(0) is consis- 
tent with that determined from 3-D clustering analysis using large 
galaxy spectroscopic surveys like DEEP2. 

Care should be exercised in the selection of the galaxy sample 
for which the bias is estimated to infer the AGN clustering from 
the cross-correlation function. Fig. [2] shows that the redshift distri- 
bution of the overall galaxy population at a given magnitude limit 
is wide and includes a large fraction of low redshift sources, out- 
side the redshift interval of the X-ray AGN sample (i.e. 0.7-1.4). 
One expects that the AGN/galaxy cross-correlation signal is domi- 
nated by galaxies with photometric redshift PDFs that peak within 
the redshift interval of the AGN population. It is the bias of those 
galaxies that should be used to infer the clustering of AGN. Using 
the full galaxy sample at a given magnitude limit to determine the 



AGN bias yields erroneous results. Two approaches are adopted to 
account for this effect. 

The first method limits the galaxy sample by applying appro- 
priate optical colour selection criteria to exclude from the analysis 
sources outside the redshift range of interest. We adopt the colour 
selection used by the DEEP2 team to pre-select galaxies in the red- 
shift range 0.7-1.4 (Newman et al. 2012), i.e at the same redshift 
range as the X-ray AGN. Appendix A describes how the DEEP2 
colour pre-selection is applied to the CFHTLS and MUSYC pho- 
tometric catalogues. The resulting galaxy samples are referred to 
as "DEEP2-like". The total number of galaxies in those samples 
are listed in Table [T] Figure [2] plots the N(z) of the "DEEP2-like" 
galaxy sample in the CFHTLS-D2 field and demonstrates the effi- 
ciency of the DEEP2 colour cuts in eliminating sources at z < 0.7. 
The size of the "DEEP2-like" galaxy samples in the CFHTLS and 
MUSYC survey areas is presented in Table [T] This approach has 
the advantage that one has some control on the properties of the 
galaxies that go into the cross-correlation function. 

The alternative approach is similar to that presented by Hickox 
|et al.| ( [20TT| ). Assume a particular AGN i at spectroscopic redshift 
Zi and a galaxy j which contributes f ga i,i,j to AGN/galaxy pairs at 
a particular scale DD(a, tt). For the galaxy i one can estimate the 
weight Wj = f ga i,i,j, the sum of its contribution to all pairs 
with all the AGN in the sample at all scales. The quantity Wj de- 
fines the average probability that a particular galaxy participates 
in the cross-correlation function. One can weigh each galaxy with 
Wj to construct the redshift distribution of the galaxies that con- 
tribute to the cross-correlation signal. One can then randomly draw 
samples from the galaxy catalogue that follow the weighted N(z). 
The bias of those galaxies is then used to infer the AGN clustering 
from the AGN/galaxy cross-correlation function. The galaxy sam- 
ples produced by this approach are referred to as the "resampling 
method" samples. Their size in the three survey fields is shown 
in Table [I] The redshift distribution of the "resampling method" 
galaxy sample in the CFHTLS-D2 field is presented in Fig. [2] 

Equations [7] — 1~9] are used to measure the AGN/galaxy bias, 
bAG, and the galaxy bias, b g . The X-ray AGN bias, bAGN, is 



bAG^ 



b 2 AG 



(10) 



The AGN DMH mass is inferred from the bias using the methodol- 
ogy of (e.g. |Mo & W hite 1996 ). In brief, assuming the ellipsoidal 
collapse model of Sheth et al. ( 2001 ]) the bias values are co nverted 
to v = S c /a(M) (equation (23) of da Angela et al.|2008| ), where 
cr(M) is the rms fluctuation of the density field and S c w 1.69 the 
critical overdensity required for collapse. Then v is converted to 
DMH mass using equations (A8)-(A10) of |Van den Bosch] ( [2002] ). 



3.3 Combining clustering results from different fields 

The AGN-galaxy cross-correlation function and galaxy auto- 
correlation function are determined from the combined AEGIS, 
COSMOS and ECDFS fields. This is to improve the statistical reli- 
ability of the results and minimise the impact of cosmic-variance on 
the clustering measurements. The combined correlation function is 



iaiiir) = — — - 1, 



DR a ii(r) 



where 



3 

DD all {r) = Y J DD i {r). 



(11) 



(12) 



1=1 
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Figure 2. Solid line shows the redshift distribution of the full CFHTLS 
galaxy sample (55,367) in the COSMOS field. Dashed line presents the N(z) 
of the "resampling method" galaxy sample (total of 28,150; see Section 
[3T2J. The N(z) of the "DEEP2-like" galaxies (21,731; see Appendix A) is 
plotted with the dotted line. The N(z) of each galaxy population is estimated 
by summing the photometric redshift PDFs of individual sources. 
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Figure 3. The projected real-space AGN/galaxy cross-correlation function 
in the AEGIS field. Triangles are the cross-correlation function using photo- 
metric redshifts only for the galaxy population (CFHTLS-D3). Circles cor- 
respond to the cross-correlation function using only galaxies with spectro- 
scopic redshifts from the DEEP2 and DEEP3 surveys. Errors are jackknife. 
For clarity, triangles are offset in the horizontal direction by 51ogcr=+0.01. 



DDi(r) is AGN-galaxy pairs in each of the three fields. The same 
relation is used in the case of DR pairs. 

The uncertainties in the correlation function at a given scale 
are estimated using the Jackknife methodology. Each field is split 
into five equal- size subregions. This results to a total of 15 sections 
spread over the three fields of choice. The correlation function is 
determined 15 times, excluding each time one section. The jack- 
knife error is 

2 JV - 1 ^ DR L 2 
L=l 

where N is the number of sections, i.e. 15, DRl is the data-random 
pairs in each section, DR is the total number of data-random pairs, 
£l is the correlation function measured in each section and £ is 
the combined correlation function. We experimented with different 
number of Jackknife regions. For a very small number of regions 
one expects errors dominated by poisson noise. Similarly, for very 
large number of subregions the sample variance uncertainties be- 
come negligible and shot noise is again important. The estimated 
Jackknife errors normalised to the Poisson errors (excess variance) 
at a given scale increases initially with the number of Jackknife re- 
gions, reaches a plateau and then decreases. The chosen value of 15 
Jackknife regions is within the plateau. 



4 RESULTS 

4.1 Performance of the photometric redshift based 
AGN/galaxy cross-correlation method 

Before applying the methodology presented in the previous section 
to the combined AEGIS, ECDFS and COSMOS fields we verify 



that it produces similar results with the traditional approach which 
uses spectroscopic redshift for both the AGN and the galaxy pop- 
ulation. In this exercise we use data from the AEGIS field only, 
which benefits from extensive spectroscopy for galaxies as part of 
the DEEP2 and DEEP3 surveys (Cooper et d.|2011b||20l2||New^| 
|man et al.|20T2] >. 

Fig. [3] presents the cross-correlation function of 158 X-ray 
AGN with ~ 5, 100 spectroscopic DEEP2 and DEEP3 galax- 
ies with R < 24.1 mag. This is compared with the projected 
cross-correlation function estimated by replacing the spectroscopic 
galaxy sample with - 23, 000 photometric CFHTLS -D3 "DEEP2- 
like" galaxies. The estimated AGN bias is 6 a = 1.93lo'.37 an d 
bA = 2.05^q 2J, for the photo-z based and the standard method, 
respectively (power law fits are applied on scales l-10/i _1 Mpc). 
The two methods yield consistent results on the inferred bias of 
AGN. 



4.2 Determining the bias and DMH mass of X-ray AGN at 
z=l 

Fig. [4] presents the projected AGN-galaxy cross-correlation func- 
tion from the combined AEGIS, ECDFS and COSMOS fields. Two 
sets of points are plotted. One corresponds to the cross-correlation 
function with all galaxies and the other to the cross-correlation with 
the "DEEP2-like" galaxy samples. The two cross-correlation func- 
tions are not directly comparable because the galaxy samples used 
in the calculations are different. 

Assuming a power-law form for the cross-correlation function 
we use a % 2 minimization procedure to estimate the best-fit ampli- 
tude, ro, and the slope, 7, at scales 1 — 10/i _1 Mpc, in the linear 
regime of clustering. The results are shown in Table [5] The fits use 
only the diagonal elements of the covariance matrix. This does not 
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affect the best fit values, but may bias the inferred minimum re- 
duced x 2 to values lower than unity. The errors correspond to the 
68th percentile around the minimum % 2 . Non-linearities m ay af- 
fect the correlation function on scales larger than l/i -1 Mpc (Ross 
et al.|201l| >. Chan ging our fitting scales to 0.5 — lOh 1 Mpc and 
3 - 10/i _i Mpc, the inferred AGN bias is stable at the 10% level 
consistent within the errors with the values estimated at scales 1- 
lOMpc. 

Table [3] also presents the AGN-galaxy bias, &ag, which has 
been calculated using the best-fit ro, 7 and equations |7| -|~9| The 
effective redshift distribution for the cross-correlation function in 
each of the three fields is estimated by the convolution of the red- 
shift distributions of the AGN and galaxy samples, i.e. Nag(z) = 
Nagn(z) x Ngai (z). The overall redshift distribution is calculated 
in a similar manner, Nf G (z) = N^ GIS (z) x N%g SMOS (z) x 

AT ECDFS ( \ 
^ AG { Z J- 

Figure [5] presents the measurements for the galaxy angular 
auto-correlation function for the "resampling method" galaxy sam- 
ple and the "DEEP2-like" galaxies. In fields of finite size the an- 
gular auto-correlation function estimation is biased to lower val- 
ues (integral constraint, |Groth & Peebles|1977| ). The effect of this 
bias becomes important on scales comparable to the survey size 
(~ 0.5 — 1 degrees in our case). Scales in the range 0.025° < 
< 0.2° are used to fit the galaxy auto-correlation function. At 
these scales the integral constraint is negligible. Also, these angu- 
lar separations correspond to about 1 — 10 h _1 Mpc at z ~ 1, i.e. 
the scales over which the AGN/galaxy cross-correlation function 
is measured. Following the procedure described in Section |T2] the 
galaxy bias is estimated. The results are listed in Table [3] 

As a check, we use spectroscopically confirmed galaxies in 
the AEGIS-XD field (DEEP2 and DEEP3 surveys) to determine 
the projected real- space auto-correlation function. This calculation 
yields 63 = 1.78±o.ii consistent with the bias of the "DEEP2-like" 
galaxy sample inferred from the angular auto-correlation function, 
b g = 1.62 ± 0.08 (see Table[3}. 

Combining the AGN/galaxy projected cross-correlation func- 
tion with the bias of galaxies inferred from their angular auto- 
correlation function yields the AGN bias and DMH masses listed 
in Table [3] The cross-correlation functions using the "resampling 
method" and the "DEEP2-like" galaxy samples yield bAGN and 
DMH masses that are consistent within the statistical uncertainties. 



4.3 The effect of groups in the clustering signal 

In addition to the average dark matter halo mass of AGN, it is also 
desirable to have information on their distribution in halo mass. 
Georgakakis et al. (2008 ) for example found that at z ~ 1 the frac- 
tion of X-ay AGN relative to galaxies is similar in groups and in 
the field, suggesting diverse environments for the AGN population. 
Recent results on the halo occupation of moderate luminosity X- 
ray AGN are also consistent with a wide range of halo masses for 
these sources ( [Miyaji et aT]|2011| |Allevato et~aL]|20121 ). The sig- 
nal to noise ratio of our clustering signal is insufficient for halo 
occupation analysis. We can nevertheless get a handle on the dark 
matter halos mass distribution of X-ray AGN by investigating how 
the clustering signal changes once sources in the most massive dark 
matter haloes within the surveyed area, i.e. groups, are removed. 

Groups are identified via their diffuse X-ray emission in 
the Chandra data of the ECDFS (Finoguenov et al. in prep) and 
AEGIS-XD (Erfanianfar in prep). In the c ase of C-COSMOS both 
the Chandra and the XMM observations (Cappell uti et al.|| 2009) 
in that field are used to identify groups ( |Leauthaud et al.||2010) . 



The adopted methodology for detecting diffuse X-ray sources is 
described in Finogu enov et al.|f2007| ) and |Leauthaud et ah]p010| ). 
Group optical counterparts and redshifts are estimated by search- 
ing for red-sequence galaxy overdensities within 0.5 Mpc (physi- 
cal) off the X-ray centroid. The identification of group members 
uses the Bayesian approach of |George et al.|p011| >, which esti- 
mates the probability that a galaxy belongs to a group given their 
projected separation, the redshifts of the galaxy and group (includ- 
ing photometric redshift information) and the background density 
of field galaxies. The groups identified in the COSMOS, AEGIS- 
XD and ECDFS have masses > 2 x 10 13 M in the redshift range 
0.7-1.4. The X-ray group catalogue is by no means complete. It is 
affected for example, by the variable X-ray sensitivity across the 
surveyed area, the different depths of the three X-ray fields used in 
the analysis and scatter in the relation between DMH mass and X- 
ray luminosity. Nevertheless, using X-ray selected groups allows 
us to explore, to the first approximation, the impact of the most 
massive structures within the surveyed area on the AGN clustering 
signal. 

A total of 22 X-ray AGN in the sample (5%, 22/441) are 
associated with group members, 6 AGN in the ECDFS, 7 in 
C-COSMOS and 9 in AEGIS-XD. We re-estimate the cross- 
correlation function after removing this small number of sources 
from the X-ray AGN catalogue as well as optically selected galax- 
ies within 0.5 Mpc from the X-ray centre of groups. This has a 
strong impact on the inferred DMH mass. The results are shown 
in Table|] A DMH mass of 12.68lo.4o is estimated, about 0.5 dex 
lower than the full AGN sample. The bulk of the X-ray AGN popu- 
lation at z ~ 1 lives in moderate size dark matter haloes. It should 
be emphasised that the observed decrease in the clustering signal is 
related to the way AGN are distributed in halo mass. It is not be- 
cause of serendipitous massive structures within the surveyed area 
that bias the inferred DMH mass to high values. The impact of the 
latter effect on our results is small because of the combination of 
different fields and the cross-correlation approach we adopt. 



4.4 Clustering dependence on luminosity 

Finally we investigate the dependence of the clustering signal on 
X-ray luminosity, Lx- To minimise possible aliases between the 
redshift and luminosity we first split the AGN sample at the me- 
dian redshift, z = 0.97. The X-ray sources in each redshift bin are 
further separated into two nearly equal size low and high X-ray lu- 
minosity subsamples. They are presented in Table [4] Because the 
number of sources in each subsample is low, we explore differences 
in the clustering using the relative bias 



w p (a)high-L x /w p (cr)iou 



(14) 



where w p (a)hi g h-L x , w p (a)i ow -L x are the projected cross- 
correlation functions for the high and low luminosity subsam- 
ples, respectively. In this calculation we use scales in the range 
1 — 1 0h _1 Mpc. The low-z and high-z subsamples yield b re i — 
1.25 =b 0.23 and 1.52 ± 0.35, respectively. The uncertainties are 
too large to comment on the luminosity dependence of the AGN 
clustering in the present sample. 
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Table 3. Power-law best -fit parameters ro , 7 for the AGN/galaxy cross-correlation function and galaxy angular auto-correlation function using the methods 
discussed in Section [3T2] The x 2 fits are using only the diagonal elements of the covariance matrix, xt present the reduced x 2 values. The inferred biases are 
also listed. In the case of the cross-correlation function the fit is performed at scales 1 < r < 10/i _1 Mpc. For the galaxy auto-correlation function the fit is 
on scales 0.025° < < 0.2°. The inferred AGN bias and DMH mass are also presented. 

H) 7 xl b A G r 7 xl b 9 b A GN logM DM H 

(h~ 1 Mpc) (dof) (h~ 1 Mpc) (dof) (jr%) 

Resampled galaxies 5.15±g;|2 1 - 80 -o.il 034 ( 7 ) L85 to.i4 4.9 ± 0.3 1.70 ± 0.06 2.05 (8) 1.75 ± 0.08 1.96+°;^ 12.9ll°; 22 

DEEP2-like galaxies 5 - 6 °to".50 1 - 72 io.i8 022 ( 7 ) L96 ± °- 15 4 - 5 ± °- 3 L60 ± °- 08 °- 96 ( 8 ) L62 ± °- 08 2 - 38 ± °- 38 13 - 2 °io.27 

AGN in X-ray groups excluded 4.46^°;^ 1.72±g-J| 0.17(7) 1.61±° ^| 4.3 ± 0.3 1.55 ± 0.07 0.87 (8) 1.62 ± 0.08 1.71+°;^ 12.68i°; 2 ^ 



Table 4. The number of X-ray AGN in z — Lx subsamples. 



Field 


0.7 < z < 0.97 
41.0 < L x < 42.7 


0.7 < z < 0.97 
L x > 42.7 


0.97 < z < 1.4 
41.0 < L x < 43.2 


0.97 <z<1.4 
L x > 43.2 


COSMOS 


25 


69 


22 


66 


AEGIS 


58 


23 


56 


21 


ECDFS 


28 


17 


32 


24 


Total 


111 


109 


110 


111 



10 



*x = cor with DEEP2 like galaxies 
ix-cor with full galaxy samples 



a(h" 1 Mpc) 

Figure 4. AGN-galaxy projected cross-correlation function from the com- 
bined AEGIS, COSMOS and ECDFS fields. Filled triangles present the re- 
sults using the "DEEP2-like" galaxy sample. Open triangles show the re- 
sults using the full galaxy samples. For clarity, filled triangles are offset in 
the horizontal direction by 5\oga=+0.0l. 



0.1 



0.001 



it 




*DEEP2 like galaxies 
1 resampling 



S(degrees) 



Figure 5. Galaxy angular auto-correlation function from the combined 
AEGIS, COSMOS and ECDFS fields. Filled triangles are the results us- 
ing the "DEEP2-like" galaxy sample. Open triangles show the results using 
the "resampling method" galaxy sample. Open triangles are offset by -0.02 
in the horizontal axis for clarity. 



5 DISCUSSION 

5.1 The clustering estimation method 

A method is presented to estimate the projected real-space cross- 
correlation function between X-ray AGN and galaxies that requires 
spectroscopy only for the AGN population and uses photometric 
redshift probability distribution functions for galaxies. The bias of 
AGN is inferred by estimating the clustering properties of the tracer 
population, in our case galaxies, from their angular auto-correlation 
function and (photometric) redshift distribution, N(z). A potential 



source of systematic error at this step is that the tracer population 
may have a very different N(z) from the sources of interest, for 
which one wishes to estimate the bias. In our case, galaxies have 
a much wider N(z), that includes many low redshift sources, com- 
pared to X-ray AGN. Hickox et al. ( 201 1 ) propose the resampling 
approach, also used here, to account for this effect. A downside of 
this methodology is that there is little control on the selection of 
the galaxies that enter the auto-correlation function determination. 
These are chosen in a probabilistic manner based on how much they 
contribute to the cross-correlation clustering signal with AGN. 
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Table 5. Clustering measurements for X-ray AGN from the literature. Columns are: (1) reference to the AGN sample; (2) name of the survey that the AGN 
sample was selected from; (3) number of sources used; (4) median redshift of the sample; (5) median X-ray luminosity of the sample; (6) the AGN bias values 
re-calculated using the real-space power law fits quoted in the studies and equations [7| -|"9] 



Study 


Sample 


no 
of objects 


z 


Lx 
(ergs -1 ) 


b 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


This Work (DEEP2-like) 


COSMOS/AEGIS-XD/ECDFS 


441 


0.97 


42.9 


2.38 ±0.38 


This Work (resampling) 


COSMOS/AEGIS-XD/ECDFS 


441 


0.97 


42.9 


± - yD -0.31 


Mountrichas & Georgakakis (2012) 


XMM/SDSS 


297 


0.10 


42.1 


-1 9 o+0.12 
i - zo -0.17 


Coil et al. (2009) 


AEGIS 


113 


0.90 


43.2 


1.97 ±0.25 


Gilli et al. (2005) 


CDFN 


160 


0.96 


43.3 




Gilli et al. (2005) 


CDFS 


97 


0.84 


43.5 


2.64 ±0.30 


Gilli et al. (2009) 


COSMOS 


538 


0.98 


43.4 




Yang et al. (2006) 


CDFN 


252 


0.8 


42.6 


-1 77 +0.20 
L - ' ' -0.30 


Yang et al. (2006) 


CLASXS 


233 


1.2 


43.8 


3.58+;-;; 


Starikova et al. (2011) 


Bootes 


1282* 


0.37 


42.7 


1 55+0:15 


Starikova et al. (2011) 


Bootes 


1282* 


0.74 


43.4 


Z - OO -0.31 


Starikova et al. (2011) 


Bootes 


1282* 


1.28 


44.0 


2 93 +0 - 43 

^• yo -0.43 


Cappelluti et al. (2010) 


BAT 


199 


0.045 


43.5 


-1 99 +0.09 
-0.08 


Allevato et al. (2011) 


COSMOS 


593 


1.22 




2-80lg;2g 


Krumpe et al. (2010) 


RASS 


1552 


0.27 


43.4 


l 11+0.10 


Krumpe et al. (2012) 


RASS 


629 


0.13 


42.8 


1 19+ - 08 

- L -- Ly -0.09 


Krumpe et al. (2012) 


RASS 


1552 


0.25 


43.4 


1 06 +0 - 09 
J - UD -o.n 


Krumpe et al. (2012) 


RASS 


876 


0.42 


43.8 


96+ 22 



*This number corresponds to the total number of sources in redshift range 0.17 < z < 4.5. 



The resampling approach, and any uncertainties that this may 
introduce in the results, can be avoided, if the galaxies are selected 
to have a relatively narrow N(z) that matches that of X-ray AGN. 
An example of this approach is the application of the DEEP2 colour 
cuts to photometrically pre- select galaxies in the redshift interval 
~ 0.7 — 1.4. Resampling is not needed in this case. We demon- 
strate this point by re-estimating the AGN bias after applying the 
resampling methodology to the "DEEP2-like" galaxy samples of 



npl] 

Table nl This exercise yields an AGN bias of 6 a = 2.23 
good agreement with the bias estimated using the full DEEP2-like 
samples, 6 a = 2.38 ± 0.38 (see Table|3}. 

Next we use LRGs (Luminous Red Galaxies) to further 
demonstrate that a tracer population with a narrow redshift dis- 
tribution is well suited to the clustering estimation method based 
on photometric redshifts. Because of their optical SEDs which is 
dominated by old stars and is characterised by a prominent 4000 A 
break, LRGs can be photometrically selected in well defined and 
relatively thin redshift slices (Eisenstein et al. 2001). We choose to 
apply our methodology to UV-selected QSOs and LRGs selected 
in the 2SLAQ ( 2dF-SDSS LRGs and QSO; |Cannon et al.1|2006| 
|Ross et al.|2007| ) survey to estimate the QSO/LRG cross-correlation 
function and then infer the bias of the QSOs in that survey. We 
refer the reader to Appendix B for details on the 2SLAQ survey. 
In short, we use 448 spectroscopically identified UV-bright QSOs 
and « 10 000 LRGs photometrically selected by 2SLAQ at red- 
shifts 0.35 - 0.75. We associate the « 10 000 2SLAQ LRGs with 
the photometric redshift PDFs estimated by Cunha et al. (2009) 
based on the Sloan Digital Sky Survey (SDSS) DR7 photometry. 
The QSO/LRG real space projected cross-correlation function is 
estimated using the LRG photometric redshift PDFs. We then es- 
timate the bias of the tracer population using either the full LRG 
sample or by applying the resampling method of section Both 
approaches give consistent results for the inferred bias of 2SLAQ 
QSOs, b Q = 1.72±° f 4 and b Q = 1.58±g;|g, respectively (see Ap- 



+0.48 



pendix B for details). For tracer populations with relatively narrow 
redshift distributions there is no need to follow the resampling ap- 
proach to infer the clustering of the target population (i.e. QSOs). 



5.2 The clustering properties of X-ray AGN 

The AGN/galaxy cross-correlation method of section|3]is applied to 
the combined AEGIS, COSMOS and ECDFS fields. The inferred 
bias of X-ray AGN is compared with previous measurements in 
Figure [6 The literature bias datapoints in that figure are presented 
in Table 5] They are re-estimated as explained in Mountrichas & 
Georgakakis (2012). Our bias determinations are consistent with 
previous studies at similar Lx and z intervals that control the im- 
pact of sample variance either by combining data from different 
surveys or by using the AGN/galaxy cross-correlation function. In- 
deed some the highest bias values of X-ray AGN at z ~ 1 in Figure 
[6] are because of serendipitous cosmic structures within the sur- 
veyed area (e.g. |Gilli et al.|2005||2009) . 

The picture that emerges from Figure [6] is that moderate lu- 
minosity X-ray AGN reside in DMHs with mean mass of about 
1O 13 /i _1 M from z w to z — 1. However, we also find evi- 
dence that this average is skewed to high values as a result of the 
distribution of AGN in halo mass. Excluding the 5 per cent of the 
X-ray AGN in the sample associated with X-ray selected groups, 
reduces the inferred DMH mass of the remaining population by 
« 60% to « 4 x 10 12 /i _1 M . It is emphasised that this is not be- 
cause of sample variance, in the sense that overdensities within the 
surveyed area affect the clustering estimation. It is rather a conse- 
quence of the way AGN are distributed in DMH mass. Our analysis 
therefore suggests that a large fraction of the moderate luminosity 
X-ray AGN at z ~ 1 live in DMHs with masse s similar to those of 
UV bright QSOs, few times IP 12 /! -1 M (e.g. |Croom et al.|2005 



|da Angela et al.|2008[|Ross et al.|20"09] >. This DMH mass scale is 



consistent with the predictions of models which invoke cold gas 
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Figure 6. The inferred bias of AGN as a function of redshift. Our measurements using both the "DEEP2-like" and the "resampling method" galaxy samples, 
are plotted with the filled circles. The AGN bias estimation based on the "resampling method" galaxy sample is offset in the horizontal axis by Sz = +0.02 
for clarity. The points have been colour coded based on the median X-ray luminosity of each sample. The dotted lines present the expected b(z) of DMHs. 



accretion via either mergers or disk instabilities to build SMBHs at 
the centres of galaxies (e.g Hopkins et al. 2007 ; Bo noli et aT7| 2QQ9 ; 
Fanidakis et al. 201 2j. In contrast, if SMBHs grow their mass via 
accretion of gas from a host quasi-static atmosphe re ([Croton et al. 
2006 e.g.) much larger DMH mass are expected (Fanidakis et al. 
2012 ). Our analysis therefore favours cold gas accretion as the main 
channel of SMBH growth for X-ray AGN. 



6 CONCLUSIONS 

A method is presented to estimate the bias of any extragalactic pop- 
ulation, for which spectroscopy is available, via the estimation of 
their real space cross-correlation function with a tracer population, 



for which only photometric redshift PDFs are available. We argue 
that this method works best when the tracer population has a nar- 
row redshift distribution that extends over the same range as the 
sources for which the clustering needs to be estimated. 

The method is applied to moderate luminosity [Lx(2 — 
10 keV) w 10 43 erg s -1 ] X-ray AGN at z w 1 selected in AEGIS, 
COSMOS and ECDFS. The tracer population are optically selected 
galaxies in those fields. DMH masses of 10 13 h" 1 M are esti- 
mated for the X-ray AGN samples, in good agreement with previ- 
ous studies at similar redshift and luminosity intervals. 

We also find evidence that the distibution of AGN in dark mat- 
ter halo mass in skewed. After excluding 5% of the AGN in the 
sample associated with X-ray groups, we estimate a 0.5 dex lower 
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DMH mass, log M « 12.5 (h' 1 M©). Therefore, the bulk of the 
X-ray AGN live in environments similar to those predicted by cold 
gas accretion models for the growth of SMBHs, e.g. gaseous major 
galaxy mergers or disk instabilities. 
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APPENDIX A: COLOUR TRANSFORMATIONS TO THE 
DEEP2 BRI FILTERSET 

The analysis presented in this paper exploits the fact that simple 
colour cuts provide an efficient way of selecting galaxies in well 
defined redshift intervals. We choose to study AGN in the redshift 
range 0.7 — 1.4 and therefore adopt the DEEP2 survey photomet- 
ric criteria for redshift pre- selection (Newman et al. 2012 ). They 
showed that galaxies below and above z — 0.7 separate well in the 
B — R vs R — I colour space, thereby yielding nearly complete 
samples of galaxies at z > 0.7. 

Applying the DEEP2 colour cuts to the MUSYC and CFHTLS 
photometric catalogues requires the determination of transforma- 
tions between the filtersets of those surveys and the DEEP2 BRI 
photometric bands. 
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Al The MUSYC dataset 

In the ECDFS we use the MUSYC Subaru vl.O catalog which pro- 
vides photometry in 32 bands from the UV to the mid-infrared 
(Cardamo ne et al.|2010|) . For the DEEP2 we use the photometric 
catalogue of |Coil et al.| ( |2004| ). As there is no overlap between 
the MUSYC and any of the DEEP2 fields, the colour transfor- 
mations between the filtersets in the two photometric surveys are 
determined using theoretical stellar tracks. The LePHARE code 
( [Arnouts et~aT]|19991 |Ilbert et al.1|2006] > is used to convolve the 
DEEP2 and MUSYC B, R and I filters with the pickles | ( fT998) > 
stellar templates and predict the B — R and R — I colours of stars 
in the two filtersets. In Figure [AT] these stellar tracks are compared 
with the photometry of optically unresolved sources (i.e. stellar 
like) in each of the two surveys. In the DEEP2 catalogue these are 
defined as objects with Pgal < 0.2 (probability that the source is 
resolved, see |Coil et al.|2004| for details) and 19 < Rr>2 < 23 mag, 
where Rr>2 is the optical magnitude measured in the DEEP2 R- 
band filter. In the case of the MUSYC catalogue stars are selected 
by requiring that the parameter STAR_FLAG equals unity (see Car- 
|damone et al.|2010| for details) and 19 < Rm < 23 mag, where Rm 
is the optical magnitude measured in the MUSYC i?-band filter. 
Figure |A1| shows that the photometric calibration of the MUSYC 
and DEEP2 surveys is offset from the theoretical stellar tracks by 
about +0.05 and +0.15 mag respectively in the R—I colour. These 
systematic offsets should be taken into account when converting 
from the MUSYC to the DEEP2 filterset. The source of these off- 
sets is beyond the scope of the analysis presented here. 

Next we compare the MUSYC and DEEP2 filtersets by plot- 
the (B 



A2 



ting in Figure 
(R — J)m against the (R- 



R)m versus the (B — R)r>2 and the 



J)d2 colours of the |Pickles| ( |1998| ) stel- 
lar templates. In the notation above the subscripts M and D2 denote 
the MUSYC and DEEP2 bands respectively. For simplicity, linear 
relations are fit to the stellar tracks in Figure [A2| to transform the 
(B — R)m and (R — I)m colours to (B — R)r>2 and (R — 7)d2 
respectively. Fitting higher order polynomials to the data points in 
Figure [A2| does not change the results. The best-fit linear relations 
are 



(B - R)r> 2 = 0.05 + 1.2 (B - R) M , 
(R - j) D2 = 0.007 + 0.8 (R - J) M . 



(Al) 
(A2) 



Figure|A3|plots R M -Rr>2 against (R-I) M for the |Pickles| |l998) 
stellar templates. It shows that Rm — Rr>2 is well correlated with 
the (R — I)m colour up to (R — I)m ~ 0.5. At redder colours the 
relation appears to flatten and the scatter in the theoretical stellar 
locus increases. We choose to fit a linear relation to the datapoints 
up to (R — I)m = 0.5. At redder colours, the Rm — Rr>2 is ap- 
proximated with a constant fixed to the value of the linear relation 
at (R — I)m = 0.5. Although this approximation may lead to sys- 
tematic uncertainties in the conversion from Rm to i?D2, these are 
expected to be at the level of few tenths of a magnitude. The func- 
tional form used to describe the variations of the Rm — Rr>2 with 
(R - I) M is 



Rr>2—R]\ 



0.078 + 0.005 (R - J) M , (R - I)m < 0.5, 




-0.5 



0.5 1 1.5 2 
(R-I) u (mag) 



Figure A3. Rm — Rr>2 against (R — I)m colour. The red triangles are the 
Pickles (1998) stellar templates. The black line shows the relation used to 
approximate the theoretical stellar track. 



|A2[|A3| The number counts in the Rd2 band are then constructed 
for MUSYC sources classified as galaxies (STAR_FLAG parameter 
equals null; see Cardamone et al. 2010 for details). These are then 
compared with the it^-band number counts of DEEP2 galaxies 
(extended source probability parameter Pgal > 0.2; see Coil et al. 
2004 for details). The galaxy counts normalised to the area of each 
photometric survey are plotted in Figure |A4| There is a systematic 
offset of about 0.2 mag between the two photometric catalogues. 
Similar offsets are also apparent in the Bd2 and Id2 galaxy counts. 
This suggests that the discrepancy is because of differences in the 
determination of total magnitudes in the two surveys and is not re- 
lated to the colour transformation between the two filtersets. This 

-R)D2 



A5 



which compares the (B 



is further supported by Figure , 
and (R - I) D 2 distributions of the MUSYC and DEEP2 surveys. 
These are constructed by selecting galaxies in the two surveys with 
18 < Rd2 < 24 mag. The faint magnitude limit is chosen to min- 
imise the effect of incompleteness in the shallower DEEP2 photo- 
metric survey. In the case of the MUSYC catalogue an offset of 
+0.2 mag is applied to the estimated Rd2 to account for the sys- 
tematic offset in the number counts of the two surveys. In the case 
of the (R — I)d2 colour distribution the offsets between the theo- 
retical stellar tracks and the MUSYC/DEEP2 photometry (see Fig. 
|A1| ) have also been applied to the data. The agreement between the 
MUSYC and DEEP2 colour distributions in Figure |A5| indicates 
that the colour transformations of equations | A 1 [ | A2| are robust and 
can be used to convert the fluxes measured through the MUSYC 
filterset to the DEEP2 BRI bands. 



0.04, 



(R-I)m > 0.5. 



As a consistency check of the colour transformations |A1[ |A2[ |A3| 
we compare the magnitude and colour distributions of galaxies in 
the DEEP2 and MUSYC surveys. For each source in the MUSYC 
photometric catalogue we estimate its (B — R)r>2, (R — I)d2 
colours and Rd2 magnitude by applying the linear relations [AT] 



(A3) A2 The CFHTLS dataset 

The CFHTLS D3 field overlaps with the DEEP2 survey in the 
Extended Groth Strip (EGS). It is therefore possible to determine 
the transformations from the CFHTLS ugriz bands to the DEEP2 
BRI filterset using sources that are common in the two surveys. 
The data release T0004 of the CFHTLS photometric catalogue 
( [Coupon et al.| 2009) is cross-matched with the DEEP2 source list 
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Figure Al. B — R\s R — I stellar locus. The panel on the left is for the MUSYC filterset (subscript "M" in B — R and R — I colours). The right panel 
is for the DEEP2 bands (subscript "D2"). In both panels the black dots are optically unresolved sources detected in the MUSYC (left) and DEEP2 (right) 
photometric surveys. The red triangles are the theoretical stellar tracks determined by convolving the Pickles (1998) stellar template library with the MUSYC 
(left panel) and DEEP2 (right panel) BRI filters. 
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(B-R) u (mag) (R-I) M (mag) 



Figure A2. Left panel: (B — R)m colour estimated through the MUSYC bands against (B — R)r>2 colour determined through the DEEP2 filters. The 
red triangles are the Pickles (1998) stellar templates. The black line shows the best-fit linear relation to the theoretical stellar track. Right panel: (R — I)m 
colour estimated through the MUSYC bands against (R — 7)d2 colour determined through the DEEP2 filters. The red triangles are the Pickles (1998) stellar 
templates. The black line shows the best-fit linear relation to the theoretical stellar track. 



( |Coil et al.|2004| ) using a search radius of 2 arcsec. Regions of poor 
photometry because of nearby bright stars or diffraction spikes are 
masked out in this exercise. 

The transformations of the g — r and r — i colours to 
(B — R)r>2 and (R — 7)d2 respectively are determined using opti- 
cally resolved sources in the deeper CFHTLS catalogue (parameter 
FLAG_TERAPIX equals null, |Coupon et al.|2009] > with 18 < R D 2 < 
24 mag. We choose to use galaxies to determine the transformation 
between colours in the two filtersets to have a handle on the scatter 



expected in those relations because of photometric uncertainties, 
the different approaches for determining colours in the two surveys 
as well as the diversity of the Spectral Energy Distributions of ex- 
tragalactic sources. Figure |A6] compares the CFHTLS and DEEP2 
filtersets by plotting for galaxies in the overlap region of the two 
surveys the (B — R)r>2 colour versus g — r and the (R — 7)d2 
colour against r — i. The mean (B — R)r>2, [(R — ^)d2] and its 
standard deviation within g — r (r — i) colour bins of 0.1 mag size 
are also shown in Figure |A6| A second order polynomial is fit to 
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Figure A4. Galaxy number counts in 0.1 mag bins for the i?D2 filter. The 
red curves correspond to the 1 sigma rms envelope of the galaxy counts in 
the DEEP2 photometric survey of the Extended Groth Strip. These observa- 
tions become incomplete fainter than R w 24 mag resulting in the observed 
turnover at faint magnitudes. The blue dots are for the MUSYC photomet- 
ric catalogue, in which case Rm is converted to Rd2 using equation [A3] 
The number counts are normalised to the area of each survey, 0.33 deg 2 in 
the case of MUSYC and 1 . 18 deg 2 for the DEEP2 survey of the Extended 
Groth Strip. There is systematic offset of about 0.2 mag between the two 
surveys. 



the binned datapoints in the (R — I)r>2 versus r — i plot, while a 
linear relation is used for the (B — R)r>2 against g — r colour plot. 

(B - R)b2 = 0.04 + 1.63 (g - r), (A4) 

(R - 7)d2 = 0.054 + 0.84 (r - i) + 0.11 (r - if. (A5) 

The conversion from the CFHTLS r-band to the DEEP2 Rr>2 uses 
optically unresolved sources (i.e. stars). This is because in this case 
one needs objects with accurate photometry that is least affected by 
systematic s related to e.g. the extent of the source or the determina- 
tion of aperture corrections. We select optically unresolved sources 
in the CFHTLS (parameter FLAG_TERAPIX equals unity, Coupon 
|et al.|2009] > with DEEP2 counterparts and magnitudes in the range 
18 < Rd2 < 23 mag. We choose not to use fainter sources to keep 
photometric errors small and also to avoid contamination by galax- 
ies at fainter magnitudes. Figure |A7| plots Rr> 2 — r against r — i. 
Also shown are the average and standard deviation of Rr> 2 — r 
within r — i colour bins of 0.2 mag in size. Figure [AT] shows the 
best-fit linear relation to the data 



RB2 



-0.01- 0.19 (r 



(A6) 



The r-band magnitude of each galaxy in the CFHTLS D3 field is 
converted to Rr> 2 and the corresponding galaxy number counts in 
that filter are constructed. These are then compared with the galaxy 
number counts estimated directly from the DEEP2 photometric sur- 
vey of the EGS field in Figure |A8] There is no evidence for system- 
atic offsets in the two distributions. 

We also investigate the accuracy of the transformations |A4[ 
A5 by comparing the (B — R)r> 2 and (R — I)d2 distributions of 
galaxies inferred from the CFHTLS gri photometry with those es- 
timate directly from the DEEP2 photometric survey in the EGS. 




0.5 1 1.5 2 
(R-I)w (mag) 



Figure A5. (B — R)y>2 and (R — I) e>2 colour distributions in 0.40 mag 
bins of galaxies selected in the MUSYC (blue solid line) and DEEP2 (red 
hatched histogram) surveys. For MUSYC sources the colours are estimated 
using the colour transformations [AT||A2| The colour distribution is limited 
to galaxies in the two surveys with 18 < Rd2 < 24 mag. 



In this exercise we also take into account the scatter of datapoints 
around the best- fit relations in Figure |A6| The rms scatter of the 
(B — R)t>2 versus g — r relation is estimated to be 0.15 mag. For 
the (R — I)r>2 versus r — i plot we determine an rms scatter of 
0.1 dex. These numbers include photometric errors, differences in 
the determination of colours in the two surveys as well as any in- 
trinsic scatter in those relations associated with e.g. the diversity 
of the galaxy SEDs. For each CFHTLS galaxy we apply equations 
A4}|A5]to infer its (B — R)r> 2 and (R-I)r>2. We then added Gaus- 



sian deviates to those colours with Half Width Half Maximum of 
0.15 and 0.1 dex respectively. The resulting colour distributions are 
compared with those determined for galaxies in the DEEP2 photo- 
metric survey in Figure |A9| This comparison is only for resolved 
sources in the two surveys with 18 < i?D2 < 24 mag. There is fair 
agreement between the CFHTLS and DEEP2 colour distributions 
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Figure A6. Left: (B — R)r>2 plotted against g — r colour. The black dots are resolved sources in the D3 region of the CFHTLS (parameter flag_terapix 
equals null, Coupon et al. 2009) with DEEP2 photometric survey counterparts and magnitudes in the range 18 < Rd2 < 24 mag. The mean (B — -R)d2 
and 1 sigma rms within g — r bins (0.1 mag size) are shown with the red circles and errorbars. The red line is the best-fit linear relation to those datapoints. 
The rms scatter around this line is estimated to be about 0.15 mag. Right: (R — 7)d2 plotted against r — i colour. The black dots are resolved sources in the 
D3 region of the CFHTLS with DEEP2 counterparts in the magnitude range 18 < Rr>2 < 24 mag. The mean (R — 7)d2 and 1 sigma rms within r — i bins 
(0.1 mag size) are shown with the red circles and errorbars. The red curve is the best-fit second order polynomial to those datapoints. The rms scatter around 
this line is estimated to be about 0.1 mag. 
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0.5 1 
r-i (mag) 




Figure A7. i?D2 — r plotted against r — i colour. The black dots are unre- 
solved sources in the D3 region of the CFHTLS (parameter flag_terapix 
equals unity, Coupon et al. 2009) with DEEP2 counterparts in the mag- 
nitude range 18 < Rd2 < 23 mag. The mean i?D2 — f and 1 sigma rms 
within r — i bins (0.2 mag size) are shown with the red circles and errorbars. 
The red line is the best linear fit to those datapoints. 



Figure A8. Galaxy number counts in i?D2 band. The red curves correspond 
to the 1 sigma rms envelope of the galaxy counts in the DEEP2 photometric 
survey of the EGS. These observations become incomplete fainter than R « 
24 mag resulting in the observed turnover at faint magnitudes. The blue dots 
are the inferred galaxy number counts in the CFHTLS r-band converted to 
the i?D2 filter using equation |A6] 



suggesting that at least to the first approximation the colour trans- 
formations |A4]|A5] are robust and can be used to convert the fluxes 
measured through the CFHTLS filterset to the DEEP2 BRI bands. 



APPENDIX B: THE 2SLAQ QSO-LRG SAMPLE 

The 2dF-SDSS LRG And QSO survey ( [Cannon et al][2006l l is a 
spectroscopic program that used the 2dF facility at the AAT tele- 
scope to follow Luminous Red Galaxies in the redshift range 0.35- 
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Figure A9. (B — R)r>2 and (R — I) 02 colour distributions in 0.04 mag bins of galaxies selected in the CFHTLS-D3 (blue solid line) and EGS DEEP2 (red 
hatched histogram) surveys. For CFHTLS-D3 sources the colours are estimated using the colour transformations |A4] | A5 1 The colour distribution is limited to 
galaxies in the two surveys with 18 < Rd2 < 24 mag. 



0.75 and UV-bright QSOs in two equatorial strips covering a total 
area of 180 deg 2 . The photometric selection of the LRG and QSO 
candidates used the SDSS photometry and the colour criteria de- 
scribed by Cannon et al. (2006 ) and Richards et al. (2005 ), respec- 
tively. 

We use data from the northern 2SLAQ strip, which includes 
9,923 photometrically selected LRGs and 448 spectroscopically 
confirmed QSOs in the redshift interval z = 0.35 - 0.75. The LRG 
sample is matched to the photometric redshift PDFs determined 
by |Cunha et alT] p009] > using the SDSS-DR7 photometry. The 
QSO/LRG cross-correlation function is then estimated using the 
methodology described in section [3] The results are plotted in Fig- 
ure [BT] For comparison also shown in that figure is the QSO/LRG 
cross-correlation signal using spectroscopic for both populations. 
For this calculation we use the 'Gold Sample' of 2SLAQ LRGs 
(total of 5,500 LRGs |Ross et al.||2007] >, which is the most rig- 
orously defined 2SLAQ LRG sample and has the highest spec- 
troscopic completeness. The fibre collision effect, due to the fact 
that 2SLAQ QSO had lower priority than 2SLAQ LRGs for spec- 
troscopic observations, is corrected following Mountri chas et al.| 
( |2009l >. 

Applyinga power law fit, on scales 1 — 10 Mpc, and using 
equationsm-p)] the QSO-LRG bias is estimated, b QL = 1.92to\\ 7 5 . 
The LRG bias is then estimated, using (i) the full photometric LRG 
sample 10, 000) and (ii) the resampling method described in 
section [3] (^ 6, 500 LRGs). The redshift distributions of the two 
samples are plotted in Figure \B2\ The inferred QSO bias is 6q = 
1.72tg;2I (full sample) and b Q = l.bSto.H (resampling method). 
These estimates are consistent within the uncertainties. 
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Figure Bl. QSO/LRG projected real space cross-correlation measurements. 
Open circles present the results when cross-correlating 448 spectroscopic 
2SLAQ QSOs with ~ 5, 500 spectroscopic LRGs, in the northern strip of 
the 2SLAQ survey. Filled circles plot the results when the spectroscopic 
LRGs have been replaced by ~ 10, 000 photometric LRGs. Filled circles 
have been offset in the horizontal direction by 51ogcr=+0.01 for clarity. 
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Figure B2. The redshift distribution of the 2SLAQ LRG samples used in 
the analysis. 



